CDS

Accession Number TCMCG075C23603
gbkey CDS
Protein Id XP_017981488.1
Location complement(join(3550910..3551167,3551305..3551431,3551565..3551681,3551908..3552064,3552176..3552317,3552408..3552498,3552644..3552750,3552844..3552999,3553197..3553373,3553497..3553638,3553869..3553988,3554101..3554220,3554337..3554396,3554826..3554947,3555075..3555116))
Gene LOC18591840
GeneID 18591840
Organism Theobroma cacao

Protein

Length 645aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018125999.1
Definition PREDICTED: probable rhamnogalacturonate lyase B [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCATCTCGGGGAGTGCAGTTAGATGTCCAAGATAAATATGTGGTGATGGATAATGACATACTGCAGGTGACAATATCAAACCCGGGTGGGATTGTTACCGGTTTGCAATATAATGGCATCGACAATTTGCTTGCTGTTGAAAACGATGAAACTGATAGAGGGTATTGGGACCTTGTCTGGAATCTAGCAGGAAGGAAAGGAACAAAGGGTAAATTTGACAGGATTGAAGCAACAAGCTTTAATGTAATAGTGGAAAATGAGGAGCAGGTGGAGCTCTCATTTACTAGAATGTGGGATTCTTCCCTTGAAGGCAGAGTCGTTCCCTTGAATATTGACAAGAGGTTTGTAATGCTTCGTAATTCCTCAGGATTCTATACCTATGCAATTTACGAGCACTTGGAGGAATGGCCTGCTTTCATCCTTGACAGGCTCAGGGTCGCATTCAAGCTCAGGAAAGACAAGTTTCATTACATGGCTATGGCAGACAACAGGCAAAGATGCATGCCCCTGCCTGATGACCGGTTACCATACAGAAGCCAAACCCTGGCCTATCCCGAGGCAGTCCTACTCGTTGATCCACTGGAACCGGAGTTTAGAGGAGAGGTAGATGACAAGTACCAATACTCATGTGAGAACAAGGACAACAGGGTCCATGGTTGGATTTGCACTGACCCCCCCGTCCCCGTGGGGTTCTGGCAAATAACACCCAGCGATGAGTTTCGATCTGCTGGCCCCCTCAAGCAAAACCTTACCTCACATGTCGGCCCCACCACCCTTGCTGTTATGCACAGCGTTCATTATTCAGGAGAGGATTTGTTATTGAAATTTGGAACTAATGAGCCCTGGAAGAAAGTTTTTGGCCCTATTTTTATTTACCTTAATTCTTTGTCCAATGGAGGTGACCCACTTTCACTTTGGGAGGATGCTAAAGAACAGATGACAATAGAAGTACAAGGCTGGCCTTACAGTTTCCCTGTTTCTGAAGACTTTCCACAGTCTGATCAAAGGGGCAATGTCAGTGGTAGATTATTAGTGAAAGATAGATATGTTCACGATGACAACATACCTGCCAATGGTGCTTATGTGGGCTTAGCACCTCCAGGAGATGTTGGATCTTGGCAAAGAGAAGTCAAGGGTTATCAATTCTGGACCAAAACTGATGAAGATGGCTATTTCTGCATTAATAATATTCGGACCGGAGACTATAATCTATACGCATGGGTCCCTGGCTTCATTGGAGATTATCGATACGATGTTATCATTACACCAACTGCAGGCTATGATATTTATATGGGTGATCTTGTATATGAGCCTCCAAGAGATGGCCCTACATTGTGGGAAATAGGAATCCCAGATCGCACTGCTGCAGAATTCTATGTCCCTGATCCTAGTCCAGTATATATCAACAAGTTATATGTTAATCACCAAGACAGATTTAGGCAGTATGGATTATGGGAAAGATATGCAGAGTTATATCCTGATGGAGATTTGGTCTACACAGTTGGCGTTAGTGACTATACAAAAGACTGGTTTTTTGCGCAGGTTACCAGAAAGAAAGATGATAATACATACCAAGGAACTACATGGCAAATCAAGTTCAAACTAGACTGTGTAGATGAGAGCGGAACATACAAACTGCGATTGGCATTGGCTTCTGCACATGCTTCTGAACTTCAGGTACGGGTCAATGATCCTGATGCAATTTCTCCTCTGTTTTCGAGTGGTCAAATTGGCAAGGACAACACAATTGCGAGACATGGAATTCATGGGCTCTACTGGCTGTACAATGTGGATATTCCAGGAAATCTGCTCCATGAAGGAGATAATACCATTTTTCTAACGCAATCAAAAAGCACTAGCCCTTTCCAGGGAATTATGTATGACTATATTCGTCTAGAAGGCCCTCCATCTTCTGACGCCAACAAGAGACCTTAG
Protein:  
MSSRGVQLDVQDKYVVMDNDILQVTISNPGGIVTGLQYNGIDNLLAVENDETDRGYWDLVWNLAGRKGTKGKFDRIEATSFNVIVENEEQVELSFTRMWDSSLEGRVVPLNIDKRFVMLRNSSGFYTYAIYEHLEEWPAFILDRLRVAFKLRKDKFHYMAMADNRQRCMPLPDDRLPYRSQTLAYPEAVLLVDPLEPEFRGEVDDKYQYSCENKDNRVHGWICTDPPVPVGFWQITPSDEFRSAGPLKQNLTSHVGPTTLAVMHSVHYSGEDLLLKFGTNEPWKKVFGPIFIYLNSLSNGGDPLSLWEDAKEQMTIEVQGWPYSFPVSEDFPQSDQRGNVSGRLLVKDRYVHDDNIPANGAYVGLAPPGDVGSWQREVKGYQFWTKTDEDGYFCINNIRTGDYNLYAWVPGFIGDYRYDVIITPTAGYDIYMGDLVYEPPRDGPTLWEIGIPDRTAAEFYVPDPSPVYINKLYVNHQDRFRQYGLWERYAELYPDGDLVYTVGVSDYTKDWFFAQVTRKKDDNTYQGTTWQIKFKLDCVDESGTYKLRLALASAHASELQVRVNDPDAISPLFSSGQIGKDNTIARHGIHGLYWLYNVDIPGNLLHEGDNTIFLTQSKSTSPFQGIMYDYIRLEGPPSSDANKRP